Search CORE

80 research outputs found

Protein–Protein Interaction Hotspots Carved into Sequences

Author: Alfonso Valencia
Burkhard Rost
Yanay Ofran
Publication venue: Public Library of Science
Publication date: 01/07/2007
Field of study

Protein–protein interactions, a key to almost any biological process, are mediated by molecular mechanisms that are not entirely clear. The study of these mechanisms often focuses on all residues at protein–protein interfaces. However, only a small subset of all interface residues is actually essential for recognition or binding. Commonly referred to as “hotspots,” these essential residues are defined as residues that impede protein–protein interactions if mutated. While no in silico tool identifies hotspots in unbound chains, numerous prediction methods were designed to identify all the residues in a protein that are likely to be a part of protein–protein interfaces. These methods typically identify successfully only a small fraction of all interface residues. Here, we analyzed the hypothesis that the two subsets correspond (i.e., that in silico methods may predict few residues because they preferentially predict hotspots). We demonstrate that this is indeed the case and that we can therefore predict directly from the sequence of a single protein which residues are interaction hotspots (without knowledge of the interaction partner). Our results suggested that most protein complexes are stabilized by similar basic principles. The ability to accurately and efficiently identify hotspots from sequence enables the annotation and analysis of protein–protein interaction hotspots in entire organisms and thus may benefit function prediction and drug development. The server for prediction is available at http://www.rostlab.org/services/isis

Crossref

Directory of Open Access Journals

PubMed Central

New in protein structure and function annotation: Hotspots, single nucleotide polymorphisms and the 'Deep Web'

Author: Bromberg Yana
Ofran Yanay
Rost Burkhard
Schneider Reinhard
Yachdav Guy
Publication venue
Publication date: 01/01/2009
Field of study

The rapidly increasing quantity of protein sequence data continues to widen the gap between available sequences and annotations. Comparative modeling suggests some aspects of the 3D structures of approximately half of all known proteins; homology- and network-based inferences annotate some aspect of function for a similar fraction of the proteome. For most known protein sequences, however, there is detailed knowledge about neither their function nor their structure. Comprehensive efforts towards the expert curation of sequence annotations have failed to meet the demand of the rapidly increasing number of available sequences. Only the automated prediction of protein function in the absence of homology can close the gap between available sequences and annotations in the foreseeable future. This review focuses on two novel methods for automated annotation, and briefly presents an outlook on how modern web software may revolutionize the field of protein sequence annotation. First, predictions of protein binding sites and functional hotspots, and the evolution of these into the most successful type of prediction of protein function from sequence will be discussed. Second, a new tool, comprehensive in silico mutagenesis, which contributes important novel predictions of function and at the same time prepares for the onset of the next sequencing revolution, will be described. While these two new sub-fields of protein prediction represent the breakthroughs that have been achieved methodologically, it will then be argued that a different development might further change the way biomedical researchers benefit from annotations: modern web software can connect the worldwide web in any browser with the 'Deep Web' (ie, proprietary data resources). The availability of this direct connection, and the resulting access to a wealth of data, may impact drug discovery and development more than any existing method that contributes to protein annotation

Open Repository and Bibliography - Luxembourg

Unveiling Protein Functions through the Dynamics of the Interaction Network

Author: AC Gavin
AH Tong
B Rost
B Schwikowski
C von Mering
CH Yeang
D Bu
D Lee
D Li
Daqing Li
H Hishigaki
HW Mewes
Inmaculada Leyva
Irene Sendiña–Nadal
JA Almendral
Javier M. Buldú
JF Rual
JM Stuart
Juan A. Almendral
M Ashburner
M Deng
M Punta
P Uetz
PZ Hu
R Kelley
R Sharan
RJ Cho
S Boccaletti
S Letovsky
SH Yook
Shlomo Havlin
Stefano Boccaletti
T Ito
TR Hughes
U Alon
U Karaoz
Y Ho
Yamir Moreno
Yanay Ofran
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Protein interaction networks have become a tool to study biological processes, either for predicting molecular functions or for designing proper new drugs to regulate the main biological interactions. Furthermore, such networks are known to be organized in sub-networks of proteins contributing to the same cellular function. However, the protein function prediction is not accurate and each protein has traditionally been assigned to only one function by the network formalism. By considering the network of the physical interactions between proteins of the yeast together with a manual and single functional classification scheme, we introduce a method able to reveal important information on protein function, at both micro- and macro-scale. In particular, the inspection of the properties of oscillatory dynamics on top of the protein interaction network leads to the identification of misclassification problems in protein function assignments, as well as to unveil correct identification of protein functions. We also demonstrate that our approach can give a network representation of the meta-organization of biological processes by unraveling the interactions between different functional classes

Public Library of Science (PLOS)

CiteSeerX

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

PubMed Central

PUblication MAnagement

Archivo Digital UPM

Analysis of temporal transcription expression profiles reveal links between protein function and developmental stages of Drosophila melanogaster

Author: A Singhania
A Sokolov
AE Lobley
B Marita
BR Graveley
Cen Wan
CF Wu
Christine A. Orengo
D Barrell
D Cozzetto
D Cozzetto
D Cozzetto
D Fristrom
D Sutherland
David T. Jones
DJ Montell
F Minneci
Federico Minneci
I Ezkurdia
J Friedman
JB Weiss
JC Costello
JG Lees
Jonathan G. Lees
JW Truman
L Breiman
L Lan
M Ashburner
M Friedrich
NK Cho
P Radivojac
P Tomancak
R Cagan
S Hunter
S Roy
SD Hooper
T Bossing
T Chang
T Cover
T Kojima
TR Li
VR Chintapalli
Yanay Ofran
YX Jiang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/10/2017
Field of study

Accurate gene or protein function prediction is a key challenge in the post-genome era. Most current methods perform well on molecular function prediction, but struggle to provide useful annotations relating to biological process functions due to the limited power of sequence-based features in that functional domain. In this work, we systematically evaluate the predictive power of temporal transcription expression profiles for protein function prediction in Drosophila melanogaster. Our results show significantly better performance on predicting protein function when transcription expression profile-based features are integrated with sequence-derived features, compared with the sequence-derived features alone. We also observe that the combination of expression-based and sequence-based features leads to further improvement of accuracy on predicting all three domains of gene function. Based on the optimal feature combinations, we then propose a novel multi-classifier-based function prediction method for Drosophila melanogaster proteins, FFPred-fly+. Interpreting our machine learning models also allows us to identify some of the underlying links between biological processes and developmental stages of Drosophila melanogaster

Crossref

Directory of Open Access Journals

UCL Discovery

Birkbeck Institutional Research Online

Neighbor Overlap Is Enriched in the Yeast Interaction Network: Analysis and Implications

Author: A Wagner
AC Gavin
AH Tong
Anna Tramontano
Ariel Feiglin
B Errede
Byungkook Lee
C Lin
CJ Roberts
DE Levin
E Ravasz
E Zotenko
F Radicchi
G Giaever
H Frohlich
H Jeong
I Ulitsky
I Wapinski
J Xiang
JG Cook
John Moult
K Irie
K Ohkuni
K Tedford
KA Olson
L Bardwell
L Grassi
M Jimenez-Sanchez
M Kellis
M Kupiec
M Schuldiner
M Soler
MC Gustin
MF Manolson
MP Samanta
NJ Krogan
P Rice
R Albert
R Kafri
R Kelley
R Milo
R Milo
Ron Unger
S Maslov
S Pu
S Sun
SR Collins
SR Collins
T Roemer
V Cherkasova
V Spirin
X He
Y Artzy-Randrup
Y Guan
Yanay Ofran
Publication venue: Public Library of Science
Publication date: 26/06/2012
Field of study

The yeast protein-protein interaction network has been shown to have distinct topological features such as a scale free degree distribution and a high level of clustering. Here we analyze an additional feature which is called Neighbor Overlap. This feature reflects the number of shared neighbors between a pair of proteins. We show that Neighbor Overlap is enriched in the yeast protein-protein interaction network compared with control networks carefully designed to match the characteristics of the yeast network in terms of degree distribution and clustering coefficient. Our analysis also reveals that pairs of proteins with high Neighbor Overlap have higher sequence similarity, more similar GO annotations and stronger genetic interactions than pairs with low ones. Finally, we demonstrate that pairs of proteins with redundant functions tend to have high Neighbor Overlap. We suggest that a combination of three mechanisms is the basis for this feature: The abundance of protein complexes, selection for backup of function, and the need to allow functional variation

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Network-Based Elucidation of Human Disease Similarities Reveals Common Functional Modules Enriched for Pluripotent Drug Targets

Author: AJ Butte
AL Hopkins
Annie P. Chiang
Atul J. Butte
BP Sokolov
DR Rhodes
DS Wishart
F Parrish
G Hu
H Mi
H Scherk
J Amberger
J Dudley
J Fowler
JF Rual
Joel T. Dudley
JT Dudley
JY Park
K Lage
KG Becker
KI Goh
KK Divine
MA van Driel
MA Yildirim
MP Vawter
O Bodenreider
P Shannon
PD Coleman
PD Stenson
R Chen
R Kalaria
R Sharan
Rong Chen
S Mathivanan
S Shimohama
S Suthram
S Suthram
S Zienolddiny
SH Brown
SH Kim
Silpa Suthram
T Barrett
Trevor J. Hastie
TS Keshava Prasad
U Stelzl
W Huang da
X Wu
Y Tao
Yanay Ofran
YI Liu
YT Jeon
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Current work in elucidating relationships between diseases has largely been based on pre-existing knowledge of disease genes. Consequently, these studies are limited in their discovery of new and unknown disease relationships. We present the first quantitative framework to compare and contrast diseases by an integrated analysis of disease-related mRNA expression data and the human protein interaction network. We identified 4,620 functional modules in the human protein network and provided a quantitative metric to record their responses in 54 diseases leading to 138 significant similarities between diseases. Fourteen of the significant disease correlations also shared common drugs, supporting the hypothesis that similar diseases can be treated by the same drugs, allowing us to make predictions for new uses of existing drugs. Finally, we also identified 59 modules that were dysregulated in at least half of the diseases, representing a common disease-state “signature”. These modules were significantly enriched for genes that are known to be drug targets. Interestingly, drugs known to target these genes/proteins are already known to treat significantly more diseases than drugs targeting other genes/proteins, highlighting the importance of these core modules as prime therapeutic opportunities

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Exploring the Evolution of Novel Enzyme Functions within Structurally Defined Protein Superfamilies

Author: A Andreeva
AE Todd
AL Cuff
Alison L. Cuff
AU Tamuri
BE Engelhardt
BH Dessailly
C Chothia
CA Orengo
Christine A. Orengo
DA Benson
DE Almonacid
DM Schmidt
DS Tawfik
G Caetano-Anolles
GA Reeves
Gemma L. Holliday
GJ Bartlett
GJ Binford
GL Holliday
GL Holliday
GL Holliday
HS Park
I Nobeli
Ian Sillitoe
J Ruan
J Shi
Janet M. Thornton
JP Overington
K Katoh
LH Greene
M Bashton
M Groll
M Xu
ME Glasner
MT Murakami
N Furnham
N Gallastegui
Nicholas Furnham
NJ Mulder
O Khersonsky
PF Gherardini
PJ O'Brien
Roman A. Laskowski
SC Pegg
SD Brown
SF Altschul
W Heinemeyer
WS Valdar
Yanay Ofran
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

In order to understand the evolution of enzyme reactions and to gain an overview of biological catalysis we have combined sequence and structural data to generate phylogenetic trees in an analysis of 276 structurally defined enzyme superfamilies, and used these to study how enzyme functions have evolved. We describe in detail the analysis of two superfamilies to illustrate different paradigms of enzyme evolution. Gathering together data from all the superfamilies supports and develops the observation that they have all evolved to act on a diverse set of substrates, whilst the evolution of new chemistry is much less common. Despite that, by bringing together so much data, we can provide a comprehensive overview of the most common and rare types of changes in function. Our analysis demonstrates on a larger scale than previously studied, that modifications in overall chemistry still occur, with all possible changes at the primary level of the Enzyme Commission (E.C.) classification observed to a greater or lesser extent. The phylogenetic trees map out the evolutionary route taken within a superfamily, as well as all the possible changes within a superfamily. This has been used to generate a matrix of observed exchanges from one enzyme function to another, revealing the scale and nature of enzyme evolution and that some types of exchanges between and within E.C. classes are more prevalent than others. Surprisingly a large proportion (71%) of all known enzyme functions are performed by this relatively small set of 276 superfamilies. This reinforces the hypothesis that relatively few ancient enzymatic domain superfamilies were progenitors for most of the chemistry required for life

Public Library of Science (PLOS)

CiteSeerX

Crossref

LSHTM Research Online

Directory of Open Access Journals

PubMed Central

UCL Discovery

FigShare

The Rough Guide to In Silico Function Prediction, or How To Use Sequence and Structure Information To Predict Protein Function

Author: A Armon
A Bateman
A Godzik
A Passerini
A Pierleoni
A Stark
AE Todd
AT Laurie
B Rost
B Rost
BA Shoemaker
C Notredame
CA Innis
CA Orengo
CA Wilson
CE Stebbins
CJ Jeffery
CJ Jeffery
CP Ponting
CT Porter
D Brown
D Desveaux
D Devos
D Pal
D Petrey
D Petrey
E Krissinel
E Reynolds
EP Gianchandani
F Corpet
F Ferron
F Zhou
Fran Lewitter
G Theissen
GJ Bartlett
GJ Kleywegt
GL Holliday
H Nakashima
HL Schubert
HM Berman
IM Wallace
J Hawkins
J Thompson
JA Barker
JB Bard
JC Whisstock
JG Henikoff
JM Thornton
JS Sodhi
JW Torrance
JZ Wang
K Goyal
K Hofmann
K Karplus
K Nakai
L Holm
L Jaroszewski
L Shapiro
L Wang
LJ Jensen
M Babor
M Gruber
M Linial
M Lippi
M Nayal
M Remm
Marco Punta
MJ Hartshorn
O Emanuelsson
O Lichtarge
OA Bateman
OC Redfern
P Puntervoll
PD Thomas
R Apweiler
R Kolodny
R Nair
R Nair
R Nair
RA Laskowski
RL Tatusov
S Altschul
S Shazman
SG Lee
T Gabaldon
TA Binkowski
TJ Hubbard
TK Attwood
VA Ivanisenko
W Humphrey
W Tian
Y Ofran
Y Ye
Yanay Ofran
Publication venue: Public Library of Science
Publication date: 01/10/2008
Field of study

Crossref

Directory of Open Access Journals

PubMed Central

Integrative Features of the Yeast Phosphoproteome and Protein–Protein Interaction Map

Author: A Chi
AJ Walhout
AJ Walhout
AK Dunker
AL Barabasi
BF Cravatt
C Haynes
C von Mering
CR Landry
CS Tan
CS Tan
D Fiedler
DB Murray
DJ Watts
EL Hong
ES Witze
F Diella
F Gnad
G Manning
H Imamura
H Molina
H Yu
IW Taylor
J Gsponer
J Ivanic
J Ptacek
J Rappsilber
J Rappsilber
J Villen
JD Han
JF Rual
JR Newman
JV Olsen
JV Olsen
K Shimizu
K Tarassov
KI Goh
L Giot
L Salwinski
LJ Holt
M Girvan
Masaru Tomita
N Sugiyama
N Yachie
Naoyuki Sugiyama
Nozomu Yachie
P Uetz
PH Huang
R Aebersold
R Linding
Rintaro Saito
S Li
S Maslov
SA Beausoleil
SB Ficarro
T Hunter
T Ito
T Masuda
T Pawson
W Gong
Y Ho
Y Ishihama
Y Kyono
Yanay Ofran
Yasushi Ishihama
Z Dosztanyi
Publication venue: Public Library of Science
Publication date: 27/01/2011
Field of study

Following recent advances in high-throughput mass spectrometry (MS)–based proteomics, the numbers of identified phosphoproteins and their phosphosites have greatly increased in a wide variety of organisms. Although a critical role of phosphorylation is control of protein signaling, our understanding of the phosphoproteome remains limited. Here, we report unexpected, large-scale connections revealed between the phosphoproteome and protein interactome by integrative data-mining of yeast multi-omics data. First, new phosphoproteome data on yeast cells were obtained by MS-based proteomics and unified with publicly available yeast phosphoproteome data. This revealed that nearly 60% of ∼6,000 yeast genes encode phosphoproteins. We mapped these unified phosphoproteome data on a yeast protein–protein interaction (PPI) network with other yeast multi-omics datasets containing information about proteome abundance, proteome disorders, literature-derived signaling reactomes, and in vitro substratomes of kinases. In the phospho-PPI, phosphoproteins had more interacting partners than nonphosphoproteins, implying that a large fraction of intracellular protein interaction patterns (including those of protein complex formation) is affected by reversible and alternative phosphorylation reactions. Although highly abundant or unstructured proteins have a high chance of both interacting with other proteins and being phosphorylated within cells, the difference between the number counts of interacting partners of phosphoproteins and nonphosphoproteins was significant independently of protein abundance and disorder level. Moreover, analysis of the phospho-PPI and yeast signaling reactome data suggested that co-phosphorylation of interacting proteins by single kinases is common within cells. These multi-omics analyses illuminate how wide-ranging intracellular phosphorylation events and the diversity of physical protein interactions are largely affected by each other

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central